Reinforcement with iterative punishment

نویسندگان

چکیده

We consider the efficacy of various forms reinforcement learning with punishment in evolving linguistic conventions context Lewis-Skyrms signalling games. show that strategy iterative is highly effective at optimal even complex It also robust and can be easily extended to a self-tuning variety learning. briefly discuss some virtues how it may related nature.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Striatal mechanisms underlying movement, reinforcement, and punishment.

Direct and indirect pathway striatal neurons are known to exert opposing control over motor output. In this review, we discuss a hypothetical extension of this framework, in which direct pathway striatal neurons also mediate reinforcement and reward, and indirect pathway neurons mediate punishment and aversion.

متن کامل

Kreitzer Reinforcement , and Punishment Striatal Mechanisms Underlying Movement

Physiol. Soc.. ESSN: 1548-9221. Visit our website at http://www.the-aps.org/. American Physiological Society, 9650 Rockville Pike, Bethesda MD 20814-3991. ©2012 Int. Union Physiol. Sci./Am. the physiological developments. It is published bimonthly in February, April, June, August, October, and December by (formerly published as News in Physiological Science) publishes brief review articles on m...

متن کامل

Asymmetry of reinforcement and punishment in human choice.

The hypothesis that a penny lost is valued more highly than a penny earned was tested in human choice. Five participants clicked a computer mouse under concurrent variable-interval schedules of monetary reinforcement. In the no-punishment condition, the schedules arranged monetary gain. In the punishment conditions, a schedule of monetary loss was superimposed on one response alternative. Devia...

متن کامل

An Iterative Reinforcement Approach for Fine-Grained Opinion Mining

With the in-depth study of sentiment analysis research, finer-grained opinion mining, which aims to detect opinions on different review features as opposed to the whole review level, has been receiving more and more attention in the sentiment analysis research community recently. Most of existing approaches rely mainly on the template extraction to identify the explicit relatedness between prod...

متن کامل

Flexible theft and resolute punishment: Evolutionary dynamics of social behavior among reinforcement-learning agents

Existing models of the evolution of social behavior typically involve innate strategies such as tit-for-tat. Yet, both behavioral and neural evidence indicates a substantial role for learned social behavior. We explore the evolutionary dynamics of two simple social behaviors among learning agents: Theft and punishment. In our simulation, agents employ Q-learning, a common reinforcement learning...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Journal of Experimental and Theoretical Artificial Intelligence

سال: 2022

ISSN: ['1362-3079', '0952-813X']

DOI: https://doi.org/10.1080/0952813x.2022.2153272